Improving speech synthesis for high intelligibility under adverse conditions
نویسندگان
چکیده
We investigate methods of improving the intelligibility of synthetic speech under noisy or low-fidelity acoustic conditions. Techniques explored improve speech in a natural manner, such that training won’t be required for the user to understand the enhanced speech. While the improvements are natural in this respect, the changes aren’t limited to creating only speech that is achievable by a human vocal tract. Modifications fall into three broad classes: increasing phoneme amplitude, altering spectral shape, and lengthening phoneme duration. Listening tests conducted in noisy and noise-free conditions demonstrate significant improvements to intelligibility for most of the subject phonemes.
منابع مشابه
Improving speech intelligibility in noise environments by spectral shaping and dynamic range compression
Speech produced under real conditions (not a recording studio, nor a quiet room) is not always equally intelligible due to the presence of background noise. This noise may mask part of the speech signal such that not all speech information is available to the listener. The ability to detect speech in noise plays a significant role in our communication with others. In this work we suggest the us...
متن کاملRephrasing-based speech intelligibility enhancement
Existing algorithms for improving speech intelligibility in a noisy environment generally focus on modifying the acoustic features of live, recorded or synthesized speech while preserving the phonetic composition (the message). In this paper, we present an algorithm for text-to-speech systems that operates at a higher level of abstraction, the message-level. We use a paraphrasing system to adju...
متن کاملSignal-to-noise ratio adaptive post-filtering method for intelligibility enhancement of telephone speech.
Post-filtering can be utilized to improve the quality and intelligibility of telephone speech. Previous studies have shown that energy reallocation with a high-pass type filter works effectively in improving the intelligibility of speech in difficult noise conditions. The present study introduces a signal-to-noise ratio adaptive post-filtering method that utilizes energy reallocation to transfe...
متن کاملAssessing the Intelligibility and Quality of HMM-based Speech Synthesis with a Variable Degree of Articulation
This paper focuses on the assessment of both the intelligibility and the quality of speech when using a variable degree of articulation (hypo/hyperarticulation) in the framework of HMM-based speech synthesis. Intelligibility is evaluated when the synthesizer is working in adverse conditions. The adaptation of a neutral speech synthesizer to generate hypo and hyperarticulated speech is first per...
متن کاملDetection of Acoustic Landmarks with High Resolution for Speech Processing
Earlier Investigations have shown that speech processing to incorporate certain acoustic characteristics of clear speech in conversational speech can improve its intelligibility under adverse listening conditions. This processing needs detection of acoustic landmarks, the important regions in speech containing cues for phoneme identification. This paper presents a method for landmark detection ...
متن کامل